Co-evolution of metabolism and protein sequences.

نویسندگان

  • Moritz Schütte
  • Niels Klitgord
  • Daniel Segrè
  • Oliver Ebenhöh
چکیده

The set of chemicals producible and usable by metabolic pathways must have evolved in parallel with the enzymes that catalyze them. One implication of this common historical path should be a correspondence between the innovation steps that gradually added new metabolic reactions to the biosphere-level biochemical toolkit, and the gradual sequence changes that must have slowly shaped the corresponding enzyme structures. However, global signatures of a long-term co-evolution have not been identified. Here we search for such signatures by computing correlations between inter-reaction distances on a metabolic network, and sequence distances of the corresponding enzyme proteins. We perform our calculations using the set of all known metabolic reactions, available from the KEGG database. Reaction-reaction distance on the metabolic network is computed as the length of the shortest path on a projection of the metabolic network, in which nodes are reactions and edges indicate whether two reactions share a common metabolite, after removal of cofactors. Estimating the distance between enzyme sequences in a meaningful way requires some special care: for each enzyme commission (EC) number, we select from KEGG a consensus set of protein sequences using the cluster of orthologous groups of proteins (COG) database. We define the evolutionary distance between protein sequences as an asymmetric transition probability between two enzymes, derived from the corresponding pair-wise BLAST scores. By comparing the distances between sequences to the minimal distances on the metabolic reaction graph, we find a small but statistically significant correlation between the two measures. This suggests that the evolutionary walk in enzyme sequence space has locally mirrored, to some extent, the gradual expansion of metabolism.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Evolutionary Relationship Between Stearoyl-CoA Desaturase (SCD) Protein Sequences Involved in Fatty Acid Metabolism

Background: Stearoyl-CoA desaturase (SCD) is a key enzyme that converts saturated fatty acids (SFAs) to monounsaturated fatty acids (MUFAs) in fat biosynthesis. Despite being crucial for interpreting SCDs’ roles across species, the evolutionary relationship of SCD proteins across species has yet to be elucidated. This study aims to present this evolutionary relationship based on amino aci...

متن کامل

Identification and Functional Prediction of Long Non-Coding RNAs Responsive to Drought stress in Lens culinaris L.

Drought stress is one of the main environmental factors that affects growth and productivity of crop plants, including lentil. In the course of evolution evolution, crucial genetic regulations mediated by non-coding RNAs (ncRNAs) have emerged in plant in response to drought and other abiotic stresses. In the present study, after identifying lncRNAs within the expression profile of lentil, RNA-s...

متن کامل

Structural Characteristics of Stable Folding Intermediates of Yeast Iso-1-Cytochrome-c

Cytochrome-c (cyt-c) is an electron transport protein, and it is present throughout the evolution. More than 280 sequences have been reported in the protein sequence database (www.uniprot.org). Though sequentially diverse, cyt-c has essentially retained its tertiary structure or fold. Thus a vast data set of varied sequences with retention of similar structure and fun...

متن کامل

(مقاله کوتاه) تجزیه فیلوژنی و تکامل مولکولی لپتین

     In the current study, phylogenetic analysis and molecular evolution of the mammalian’s Leptin was investigated. Data was achieved and aligned by searching its genome database, while all examined mammals contained only a single copy of the Leptin. The nucleotide substitution rate of the sequences and molecular evolution of the Leptin were calculated by maximum likelihood and neighbor-joinin...

متن کامل

Characterization of the Full Length Coat Protein Gene of Iranian Grapevine fanleaf virus isolates, genetic variation and phylogenetic analysis

The full-length coat protein gene of Grapevine fanleaf virus (GFLV) isolates from Iran was characterized byreverse transcription polymerase chain reaction (RTPCR) and sequencing. The expected 1515 bp coatprotein (CP) gene amplicon was obtained for 16 isolates out of 89 that were identified by double antibodysandwich enzyme-linked immunesorbent assay (DASELISA) in a population ...

متن کامل

Co-amoxiclav Effects on the Structural and Binding Properties of Human Serum Albumin

Human serum albumin (HSA) is the most abundant plasma protein in the human body. HSA plays an important role in drug transport and metabolism. This protein has a high affinity to a very wide range of materials, including metals such as Cu2+ and Zn2+, fatty acids, amino acids and metabolites such as bilirubin and many drug compounds. In this study, we investigated the effects of co-amoxiclav, as...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome informatics. International Conference on Genome Informatics

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2010